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We prove that any polynomial-time ar(n)-approximation algo¬ 
rithm for the n-vertex metric asymmetric Traveling Salesper¬ 
son Problem yields a polynomial-time 0(ar(C))-approximation 
algorithm for the mixed and windy Capacitated Arc Routing 
Problem, where C is the number of weakly connected com¬ 
ponents in the subgraph induced by the positive-demand 
arcs—a small number in many applications. In conjunction 
with known results, we obtain constant-factor approximations 
for C £ 0(log«) and 0(log C/log log C)-approximations in gen¬ 
eral. Experiments show that our algorithm, together with 
several heuristic enhancements, outperforms many previous 
polynomial-time heuristics. Finally, since the solution quality 
achievable in polynomial time appears to mainly depend on C 
and since C = 1 in almost all benchmark instances, we pro¬ 
pose the Ob benchmark set, simulating cities that are divided 
into several components by a river. 
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1 Introduction 

Golden and Wong [25] introduced the Capacitated Arc Routing 
Problem (CARP) in order to model the search for minimum-cost 
routes for vehicles of equal capacity that are initially located in a 
vehicle depot and have to serve all “customer” demands. Appli¬ 
cations of CARP include snow plowing, waste collection, meter 
reading, and newspaper delivery [12]. Herein, the customer 
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demands require that roads of a road network are served. The 
road network is modeled as a graph whose edges represent roads 
and whose vertices can be thought of as road intersections. The 
customer demands are modeled as positive integers assigned to 
edges. Moreover, each edge has a cost for traveling along it. 

Problem 1.1 (Capacitated Arc Routing Problem (CARP)). 
Instance: An undirected graph G - (V, E), a depot vertex vq 6 V, 
travel costs c: £ —> NU [0], edge demands c/: £ ^ NU [0], 
and a vehicle capacity Q. 

Task: Find a set W of closed walks in G, each corresponding 
to the route of one vehicle and passing through the depot 
vertex and hnd a serving function s: W ^ 2^ determining 
for each closed walk w eW the subset s{w) of edges served 
by w such that 

- Swevr c(w) is minimized, where c(w) cie,) for a 

walk w = (ei,e 2 , ...,ee)e £'”, 

- Y.eesiw) d(e) < Q, and 

- each edge e with d{e) > 0 is served by exactly one walk 
in W. 

Note that vehicle routes may traverse each vertex or edge of 
the input graph multiple times. Well-known special cases of 
CARP are the NP-hard Rural Postman Problem (RPP) [32], 
where the vehicle capacity is unbounded and, hence, the goal 
is to hnd a shortest possible route for one vehicle that visits 
all positive-demand edges, and the polynomial-time solvable 
Chinese Postman Problem (CPP) [18, 19], where additionally 
all edges have positive demand. 

1.1 Mixed and windy variants 

CARP is polynomial-time constant-factor approximable [6, 31, 
41]. However, as noted by van Bevem et al. [7, Challenge 5] in 
a recent survey on the computational complexity of arc routing 
problems, the polynomial-time approximability of CARP in 
directed, mixed, and windy graphs is open. Herein, a mixed 
graph may contain directed arcs in addition to undirected edges 
for the purpose of modeling one-way roads or the requirement 
of servicing a road in a specific direction or in both directions. In 
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a windy graph, the cost for traversing an undirected edge {u, v) 
in the direction from m to v may be different from the cost for 
traversing it in the opposite direction (this models sloped roads, 
for example). In this work, we study approximation algorithms 
for mixed and windy variants of CARP. To formally state these 
problems, we need some terminology related to mixed graphs. 

Definition 1.2 (Walks in mixed and windy graphs). A mixed 
graph is a triple G — {V,E,A), where R is a set of vertices, 
E c {{«, v) I M, V e y) is a set of ( undirected) edges, A c y x V is 
a set of (directed) arcs (that might contain loops), and no pair of 
vertices has an arc and an edge between them. The head of an 
arc (m, v) e y X y is v, its tail is u. 

A walk in G is a sequence w - {ai,a 2 ,, ai) such that, for 
each ai — (u, v), 1 < i < G we have (m, v) € A or {u, v) 6 E, 
and such that the tail of a, is the head of a,_i for 1 < / < ^. If 
(m, v) occurs in w, then we say that w traverses the arc (u, v) € A 
or the edge {u, v) 6 E. If the tail of a^ is the head of a{, then we 
call w a closed walk. 

Denoting by c: y x y ^ N U {0, oo) the travel cost between 
vertices of G, the cost of a walk w — (ai,... ,a{) is c(w) : = 
c(a,). The cost of a set W of walks is c(W) Yjwew c(vr). 

We study approximation algorithms for the following problem. 

Problem 1.3 (Mixed and windy CARP (MWCARP)). 
Instance: A mixed graph G - {V, E, A), a depot vertex vo e V, 
travel costs c: y x y ^ N U (0, oo), demands d: £ U A —> 
N U (0), and a vehicle capacity Q. 

Task: Find a minimum-cost set W of closed walks in G, each 
passing through the depot vertex vq, and a serving func¬ 
tion i: Vy ^ 2^^"^ determining for each walk w e W the 
subset i(w) of the edges and arcs it serves such that 

- Ijeesiw) d{e) < Q, and 

- each edge or arc e with d{e) > 0 is served by exactly one 
walk in W. 

For brevity, we use the term “arc” to refer to both undirected 
edges and directed arcs. Besides studying the approximability 
of MWCARP, we also consider the following special cases. 

If the vehicle capacity Q in MWCARP is unlimited (that is, 
larger than the sum of all demands) and the depot vq is incident 
to a positive-demand arc, then one obtains the mixed and windy 
Rural Postman Problem (MWRPP): 

Problem 1.4 (Mixed and windy RPP (MWRPP)). 

Instance: A mixed graph G = (y E,A) with travel costs c: y x 
y —» N U {0, oo) and a set R C £ U A of required arcs. 

Task: Find a minimum-cost closed walk in G traversing all arcs 
inR. 

If, furthermore, E — % 'm MWRPP, then we obtain the directed 
Rural Postman Problem (DRPP) and if R = £ U A, then we 
obtain the mixed Chinese Postman Problem (MCPP). 

1.2 An obstacle: approximating metric asymmetric TSP 

Aiming for good approximate solutions for MWCARP, we have 
to be aware of the strong relation of its special case DRPP to the 
following variant of the Traveling Salesperson Problem (TSP); 


Problem 1.5 (Metric asymmetric TSP (a-ATSP)). 

Instance: A set V of vertices and travel costs c: VxV NU{0) 
satisfying the triangle inequality c(u, v) < c(u, w) + c(w, v) for 
all u, V, w 6 y. 

Task: Find a minimum-cost cycle that visits every vertex in V 
exactly once. 

Already Christofides et al. [11] observed that DRPP is a gen¬ 
eralization of A-ATSP. In fact, DRPP is at least as hard to 
approximate as a-ATSP: Given a a-ATSP instance, one obtains 
an equivalent DRPP instance by simply adding a zero-cost loop 
to each vertex and by adding these loops to the set R of required 
arcs. This leads to the following observation. 

Observation 1.6. Any Q;(«)-approximation for n-vertex DRPP 
yields an Q;(n)-approximation for n-vertex a-ATSP. 

Interestingly, the constant-factor approximability of a-ATSP is a 
long-standing open problem and the G(log n/ log log n)-approx- 
imation by Asadpour et al. [2] from 2010 is the first asymptotic 
improvement over the G(log n)-approximation by Frieze et al. 
[24] from 1982. Thus, the constant-factor approximations for 
(undirected) CARP [6, 31,41] and MCPP [37] cannot be simply 
carried over to MWRPP or MWCARP. 

1.3 Our contributions 

As discussed in Section 1.2, any a'(n)-approximation for n-vertex 
DRPP yields an a(n)-approximation for n-vertex a-ATSP. We 
first contribute the following theorem for the converse direction. 

Theorem 1.7. If n-vertex a-ATSP is a(n)-approximable in 
t{n) time, then 

(i) n-vertex DRPP is (a(C) ■+■ \)-approximable in t(C) -I- 
0(n^ log n) time, 

(ii) n-vertex MWRPP is {a(C) -H 3)-approximable in t(C) -i- 
0{n^ log n) time, and 

(Hi) n-vertex MWCARP is (8a'(C -l- 1) -l- 21)-approximable in 
t(C -H 1) -H 0{n^ logn) time, 

where C is the number of weakly connected components in the 
subgraph induced by the positive-demand arcs and edges. 

The approximation factors in Theorem 1.7(iii) and Corollary 1.8 
below are rather large. Yet in the experiments described in Sec¬ 
tion 5, the relative error of the algorithm was always below 5/4. 

We prove Theorem 1.7(i-ii) in Section 3 and Theorem 1.7(iii) 
in Section 4. Given Theorem 1.7 and Observation 1.6, the 
solution quality achievable in polynomial time appears to mainly 
depend on the number C. The number C is small in several 
applications, for example, when routing street sweepers and 
snow plows. Indeed, we found C = 1 in all but one instance 
of the benchmark sets mval and Ipr of Belenguer et al. [4] 
and egl-large of Brandao and Eglese [10]. This makes the 
following corollary particularly interesting. 

Corollary 1.8. MWCARP is 35-approximable in G(2‘'C^ -i- 
n^ log n) time, that is, constant-factor approximable in polyno¬ 
mial time for C e G(logn). 
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Corollary 1.8 follows from Theorem 1.7 and the exact (9(2"n^)- 
time algorithm for n-vertex A-ATSP by Bellman [5] and Held 
and Karp [30]. It is “tight” in the sense that finding polynomial¬ 
time constant-factor approximations for MWCARP in general 
would, via Observation 1.6, answer a question open since 1982 
and that computing optimal solutions of MWCARP is NP-hard 
even if C = 1 [7]. 

In Section 5, we evaluate our algorithm on the mval, Ipr, and 
egl-large benchmark sets and find that it outperforms many 
previous polynomial-time heuristics. Some instances are solved 
to optimality. Moreover, since we found that the solution quality 
achievable in polynomial time appears to crucially depend on 
the parameter C and almost all of the above benchmark instances 
have C = 1, we propose a method for generating benchmark 
instances that simulate cities separated into few components by 
a river, resulting in the Ob benchmark set. 

1.4 Related work 

Several polynomial-time heuristics for variants of CARP are 
known [4, 10, 25, 34] and, in particular, used for computing 
initial solutions for more time-consuming local search and ge¬ 
netic algorithms [4, 10]. Most heuristics are improved variants 
of three basic approaches: 

Augment and merge heuristics start out with small vehicle 
tours, each serving one positive-demand arc, then succes¬ 
sively grow and merge these tours while maintaining ca¬ 
pacity constraints [25]. 

Path scanning heuristics grow vehicle tours by successively 
augmenting them with the “most promising” positive- 
demand arc [26], for example, by the arc that is closest 
to the previously added arc. 

Route first, ciuster second approaches first construct a gi¬ 
ant tour that visits all positive-demand arcs, which can 
then be split optimally into subsegments satisfying capacity 
constraints [3, 40]. 

The giant tour for the “route first, cluster second” approach 
can be computed heuristically [4, 10], yet when computing it 
using a constant-factor approximation for the undirected RPP, 
one can split it to obtain a constant-factor approximation for 
the undirected CARP [31, 41]. Notably, the “route first, cluster 
second” approach is the only one known to yield solutions of 
guaranteed quality for CARP in polynomial time. One barrier 
for generalizing this result to MWCARP is that already approxi¬ 
mating MWRPP is challenging (see Section 1.2). Indeed, the 
only polynomial-time algorithms with guaranteed solution qual¬ 
ity for arc routing problems in mixed graphs are for variants to 
which Observation 1.6 does not apply since all arcs and edges 
have to be served [15, 37]. 

Our algorithm follows the “route first, cluster second” ap¬ 
proach: We first compute an approximate giant tour using 
Theorem 1.7(ii) and then, analogously to the approximation 
algorithms for undirected CARP [31, 41], split it to obtain The¬ 
orem 1.7(iii). However, since the analyses of the approximation 
factor for undirected CARP rely on symmetric distances be¬ 
tween vertices [31, 41], our analysis is fundamentally different. 
Our experiments show that computing the giant tour using The¬ 
orem 1.7(ii) is beneficial compared to computing it heuristically 
like Belenguer et al. [4] and Brandao and Eglese [10]. 


Notably, the approximation factor of Theorem 1.7 depends 
on the number C of connected components in the graph in¬ 
duced by positive-demand arcs. This number C is small in many 
applications and benchmark data sets, a fact that inspired the de¬ 
velopment of exact exponential-time algorithms for RPP which 
are efficient when C is small [21, 28, 38, 39]. Orloff [35] noticed 
already in 1976 that the number C is a determining factor for 
the computational complexity of RPP. Theorem 1.7 shows that 
it is also a determining factor for the solution quality achievable 
in polynomial time. 

In terms of parameterized complexity theory [14, 17], one can 
interpret Corollary 1.8 as a fixed-parameter constant-factor ap¬ 
proximation algorithm [33] for MWCARP parameterized by C. 

2 Preliminaries 

Although we consider problems on mixed graphs as defined in 
Definition 1.2, in some of our proofs we use more general mixed 
multigraphs G - {V,E,A) with a set V =: V{G) of vertices, a 
multiset E E(G) over ((m, v] | m, v 6 V] of (undirected) edges, 
a multiset A A{G) over V x V of (directed) arcs that may 
contain self-loops, and travel costs c: V x V —> N U (0, oo). If 
E — d), then G is a directed multigraph. 

From Definition 1.2, recall the definition of walks in mixed 
graphs. An Euler tour for G is a closed walk that traverses 
each arc and each edge of G exactly as often as it is present 
in G. A graph is Eulerian if it allows for an Euler tour. Let 
w - (ai,a 2 ,. ■. ,af)be a walk. The starting point ofw is the tail 
of fli, the end point ofw is the head of ac. A segment of w is a 
consecutive subsequence of w. Two segments wi = (a,,... ,aj) 
and W 2 — (ai',... ,aj>) of the walk w are non-overlapping if 
j < i' or / < i. Note that two segments of w might be non¬ 
overlapping yet share arcs if w contains an arc several times. 
The distance distgCw, v) from vertex u to vertex v of G is the 
minimum cost of a walk in G starting in u and ending in v. 

The underlying undirected (multi)graph of G is obtained by 
replacing all directed arcs by undirected edges. Two vertices u, v 
of G are (weakly) connected if there is a walk starting in u and 
ending in v in the underlying undirected graph of G. A (weakly) 
connected component of G is a maximal subgraph of G in which 
all vertices are mutually (weakly) connected. 

For a multiset R c V xV of arcs, G[R] is the directed multi¬ 
graph consisting of the arcs in R and their incident vertices of G. 
We say that G[R] is the graph induced by the arcs in R. For a 
walk w = (fli,..., flf) in G, G[w] is the directed multigraph con¬ 
sisting of the arcs ai,... ,at and their incident vertices, where 
G[w] contains each arc with the multiplicity it occurs in w. Note 
that G[R] and G[w] might contain arcs with a higher multiplicity 
than G and, therefore, are not necessarily sub(multi)graphs of G. 
Finally, the cost of a multiset R is c(R) YiaeR v(a)c(a), where 
v(a) is the multiplicity of a in R. 

3 Rural Postman 

This section presents our approximation algorithms for DRPP 
and MWRPP, thus proving Theorem 1.7(i) and (ii). Section 3.1 
shows an algorithm for the special case of DRPP where the 
required arcs induce a subgraph with Eulerian connected com¬ 
ponents. Sections 3.2 and 3.3 subsequently generalize this al- 
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gorithm to DRPP and MWRPP by adding to the set of required 
arcs an arc set of minimum weight so that the required arcs 
induce a graph with Eulerian connected components. 

3.1 Special case: Required arcs induce Eulerian components 

To turn a(n)-approximations forn-vertex A-ATSP into (a(C)+l)- 
approximations for this special case of DRPP, we use Algo¬ 
rithm 3.1. The two main steps of the algorithm are illustrated 
in Figure 3.1; The algorithm first computes an Euler tour for 
each connected component of the graph G[R] induced by the 
set R of required arcs and then connects them using an approxi¬ 
mate A-ATSP tour on a vertex set Vr containing (at least) one 
vertex of each connected component of G[R]. 

The following Lemma 3.1 gives a bound on the cost of the so¬ 
lution returned by Algorithm 3.1. Algorithm 3.1 and Lemma 3.1 
are more general than necessary for this special case of DRPP. 
In particular, we will not exploit yet that they allow R to be a 
multiset and Vr to contain more than one vertex of each con¬ 
nected component of G[R]. This will become relevant in Sec¬ 
tion 3.2, when we use Algorithm 3.1 as a subprocedure to solve 
the general DRPP. 

Lemma 3.1. Let G be a directed graph with travel costs c, 
let R be a multiset of arcs of G such that G[R] consists of C 
Eulerian connected components, let Vr c y(G[R]) be a vertex 
set containing at least one vertex of each connected component 
of G[R], and let f be any closed walk containing the vertices Vr. 

If n-vertex A-ATSP is a(n)-approximable in f(n) time, then 
Algorithm 3.1 applied to (G,c,R) and Vr returns a closed walk 
of cost at most c(R) + a;(|VR|) ■ c{T) in f(|V«|) + 0(n^) time that 
traverses all arcs of R. 

Proof. We first show that the closed walk T returned by Algo¬ 
rithm 3.1 visits all arcs inR. Since the A-ATSP solution Ty^ con¬ 
structed in line 1 visits all vertices Vr, in particular vi,..., vc, 
so does the closed walk Tq constmcted in line 1. Thus, for each 
vertex v,-, I < i < C,T takes Euler tour Tj through the connected 
component i of G[R] and, thus, visits all arcs in R. 

We analyze the cost c{T). The closed walk T is composed of 
the Euler tours Ti computed in line 1 and the closed walk Tc 
computed in line 1. Hence, c{T) - c{Tq) + c(r,). Since 

each Ti is an Euler tour for some connected component i of G[R], 
each Ti visits each arc of component i as often as it is contained 
in R. Consequently, ciTf — c(R). 

It remains to analyze c(7’g)- Observe hrst that the distances in 
the A-ATSP instance {Vr,c') correspond to shortest paths in G 
and thus fulhll the triangle inequality. We have c(Tq) - cfTyf) 
by construction of the A-ATSP instance {Vr, c') in line 1 and 
by construction of Re from in line 1. Let R be any closed 
walk containing Vr and let Rj)^ be an optimal solution for the 
A-ATSP instance {Vr,c'). If we consider the closed walk fy^ 
that visits the vertices of the A-ATSP instance {Vr, c') in the 
same order as R, we get c'{TyJ < cfTyf) < c{T). Since the 
closed walk Ry^ computed in line 1 is an a(|V«|)-approximate 
solution to the A-ATSP instance {Vr,c'), it finally follows that 
c(Rc) = c'(RyJ < a(|y^|) ■ c'(R*^) < a(|y^|) ■ c(R). 

Regarding the running time, observe that the instance (y^, c') 
in line 1 can be constructed in G(n^) time using the Floyd- 
Warshall all-pair shortest path algorithm [20], which dominates 
all other steps of the algorithm except for, possibly, line 1 . □ 


Lemma 3.1 proves Theorem 1.7(i) for DRPP instances I - 
(G, c, R) when G[R] consists of Eulerian connected components: 
Pick Vr to contain exactly one vertex of each of the C connected 
components of G[R]. Since an optimal solution R* for I visits 
the vertices Vr and satishes c(R) < c(R*), Algorithm 3.1 yields 
a solution of cost at most c(R*) + a{C) ■ c{T*). 

3.2 Directed Rural Postman 

In the previous section, we proved Theorem 1 .7(i) for the spe¬ 
cial case of DRPP when G[R] consists of Eulerian connected 
components. We now transfer this result to the general DRPP. 
To this end, observe that a feasible solution R for a DRPP in¬ 
stance (G, c, R) enters each vertex v of G as often as it leaves. 
Thus, if we consider the multigraph G[R] that contains each arc 
of G with same multiplicity as R, then G[R] is a supermultigraph 
of G[R] in which every vertex is balanced [16, 39]: 

Definition 3.2 (Balance). We denote the balance of a vertex v 
in a graph G as 

balanceG(v) := indegp(y) - outdegQ(y). 

We call a vertex y balanced if balanceG(y) = 0. 

Since G[R] is a supergraph of G[R] in which all vertices are 
balanced and since a directed connected multigraph is Eulerian if 
and only if all its vertices are balanced, we immediately obtain 
the below observation. Herein and in the following, for two 
(multi-)sets X and F, W P is the multiset obtained by adding 
the multiplicities of each element in X and Y. 

Observation 3.3. Let R be a feasible solution for a DRPP in¬ 
stance (G, c, R) such that G[R] has C connected components and 
let R* be a minimum-cost multiset of arcs of G such that every 
vertex in G[R W R*] is balanced. Then, c(R W R*) < c(R) and 
G[R WR*] consists of at most C Eulerian connected components. 

Algorithm 3.2 computes an {a{C) + ^-approximation for a 
DRPP instance (G, c, R) by hrst computing a minimum-cost arc 
multiset R* such that G[R W R*] contains only balanced vertices 
and then applying Algorithm 3.1 to (G, c, R W R*). It is well 
known that the hrst step can be modeled using the Uncapacitated 
Minimum-Cost Flow Problem [11, 13, 16, 19, 22]: 

Problem 3.4 (Uncapacitated Minimum-Cost Flow (UMCF)). 
Instance: A directed graph G = {V, A) with supply s: V ^ 7. 
and costs c: A —> N U [0]. 

Task: Find a. flow f: A — > N U [0] minimizing Saeyi c(a)/(a) 
such that, for each y 6 y, 

Yj “ X ^ 

(v,w)eA (vy,v’)eA 

Equation (EC) is known as the ^ovy conservation constraint: 
For every vertex y with s(y) = 0, there are as many units of 
how entering the node as leaving it. Nodes y with s(y) > 0 
“produce” i(y) units of how, whereas nodes y with i(y) < 0 
“consume” i(y) units of how. For our purposes, we will use 
i(y) := balanceG[«](y). UMCF is solvable in 0{n^ logn) time [1, 
Theorem 10.34]. 

Lemma 3.5. Let I {G,c,R) be a DRPP instance such that 
G[R] has C connected components, and let y* be a vertex set 
containing exactly one vertex of each connected component 
of G[R]. Moreover, consider two closed walks in G; 
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(a) Input: Only required arcs R are shown, 
vertices in Vr are black. 



(b) Compute Euler tours T, (dashed) for (c) Add closed walk Tq with the vertices 

each connected component of G[I?]. in Vr to get a feasible solution T (dashed). 


Figure 3.1: Steps of Algorithm 3.1 to compute feasible solutions for DRPP when all connected components of G[R] are Eulerian. 


Algorithm 3.1: Algorithm for the proof of Lemma 3.1 

Input: A directed graph G with travel costs c, a multiset R of arcs of G such that G[R] consists of C Eulerian connected 
components, and a set Vr c y(G[R]) containing at least one vertex of each connected component of G[R]. 
Output: A closed walk traversing all arcs in R. 

1 for / = 1,..., C do 

2 V, <— any vertex of Vr in component i of G[R]; 

3 Ti <— Euler tour of connected component i of G[R] starting and ending in v,; 

4 {Vr,c') <— A-ATSP instance on the vertices Vr, where c'(y;, vj) distc(v/, v;); 

5 Tvr adERD-approximate A-ATSP solution for (VR,c'y, 

6 Tq ^ closed walk for G obtained by replacing each arc (v,, Vj) on Tvf. by a shortest path from v,- to vj in G; 

7 r «— closed walk obtained by following Tq and taking a detour E, whenever reaching a vertex v,; 

8 return T ; 


- Let t be any closed walk containing the vertices Vr, and 

- let T be any feasible solution for I. 

If n-vertex A-ATSP is a(«)-approximable in t{ri) time, then 
Algorithm 3.2 applied to I and Vr returns a feasible solution of 
cost at most c{f) + a(C) ■ c(f) in f(C) + 0(n^ log n) time. 

Proof. Eor the sake of self-containment, we first prove that Al¬ 
gorithm 3.2 in line 2 indeed computes a minimum-cost arc set R* 
such that all vertices in G[R W R*] are balanced. This follows 
from the one-to-one correspondence between arc multisets R' 
such that G[R W R'] has only balanced vertices and flows / 
for the UMCF instance /' := (G, balancecjg], c): Each vertex v 
has balanceG[R](v) more incident in-arcs than out-arcs in G[R] 
and, thus, in order for balanceG[fiiiifi'](v) = 0 to hold, R' has 
to contain balanceG[7?](v) more out-arcs than in-arcs incident 
to V. Likewise, by (EC), in any feasible flow for /', there are 
balanceG[fi](v) more units of flow leaving v than entering v. 

Thus, from a multiset R' of arcs such that G[R W R'] is bal¬ 
anced, we get a feasible flow / for /' by setting /(v, w) to the 
multiplicity of the arc (v, w) in R'. From a feasible flow / 
for /', we get a multiset R' of arcs such that G[R W /?'] is bal¬ 
anced by adding to R' each arc (v, w) with multiplicity f{v, w). 
We conclude that the arc multiset R* computed in line 2 is a 
minimum-cost set such that G[R W /?*] is balanced: A set of 
lower cost would yield a flow cheaper than the optimum flow / 
computed in line 2. 

We use the optimality of R* to give an upper bound on the 
cost of the closed walk T computed in line 2. Since Vr contains 
exactly one vertex of each connected component of G[/?], it con¬ 
tains at least one vertex of each connected component of G[R W 
R*]. Therefore, Algorithm 3.1 is applicable to {G,c,R W /?*) 


and, by Lemma 3.1, yields a closed walk in G traversing all arcs 
in W R* and having cost at most c(R ^R*) + adVRl) ■ c(T). This 
is a feasible solution for (G, c, R) and, since by Observation 3.3, 
we have c{R W /?*) < c{T), it follows that this feasible solution 
has cost at most c(f) + a(C) ■ c(f). 

Einally, the running time of Algorithm 3.2 follows from the 
fact that the minimum-cost flow in line 2 is computable in 
0(n^ log n) time [1, Theorem 10.34] and that Algorithm 3.1 
runs in f(C) -i- G(n^) time (Lemma 3.1). □ 

We may now prove Theorem 1.7(i). 

Proof of Theorem 1.7(i ). Let (G, c, R) be an instance of DRPP 
and let Vr be a set of vertices containing exactly one vertex of 
each connected component of G[R]. An optimal solution T* 
for I contains all arcs in R and all vertices in Vr and hence, by 
Lemma 3.5, Algorithm 3.2 computes a feasible solution T with 
c(r) < c(E*) H- a{C) ■ c(T*) for I. □ 

Before generalizing Algorithm 3.2 to MWRPP, we point out 
two design choices in the algorithm that allowed us to prove 
an approximation factor. Algorithm 3.2 has two steps: It first 
adds a minimum-weight set R* of required arcs so that G[R W 
R*] has Eulerian connected components. Then, these connected 
components are connected using a cycle via Algorithm 3.1. 

In the first step, it might be tempting to add a minimum- 
weight set R' of required arcs so that each connected component 
of G[R] becomes an Eulerian connected component of G[R WR']. 
However, this set R' might be more expensive than R*: Multiple 
non-Eulerian connected components of G[R] might be contained 
in one Eulerian connected component of G[R W R*]. 
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Algorithm 3.2: Algorithm for the proof of Lemma 3.5. 

Input: A DRPP instance I - (G, c, R) such that G[R] has C connected components and a set Vr of vertices, one of each 
connected component of G[R]. 

Output: A feasible solution for I. 

1 / «— minimum-cost flow for the UMCF instance (G,balanceG[R],c); 

2 foreach a e A(G) do add arc a with multiplicity f{a) to (initially empty) multiset R*; 

3 r <— closed walk computed by Algorithm 3.1 applied to (G, c,R^ R*) and V*; 

4 return T ; 


In the second step, it is crucial to connect the connected 
components of G[R W R*] using a cycle. Christofides et al. 
[11] and Corberan et al. [13], for example, reverse the two 
phases of the algorithm and first join the connected components 
of G[R] using a minimum-weight arborescence or spanning 
tree, respectively. This, however, may increase the imbalance 
of vertices and, thus, the weight of the arc set R* that has to 
be added in their second phase in order to balance the vertices 
of G[R^R*]. 

Interestingly, the heuristic of Corberan et al. [13] aims to 
find a minimum-weight connecting arc set so that the resulting 
graph can be balanced at low extra cost and already Pearn and 
Wu [36] pointed out that, in context of the (undirected) RPP, 
reversing the steps in the algorithm of Christofides et al. [11] 
can be beneficial. 

3.3 Mixed and windy Rural Postman 

In the previous section, we presented Algorithm 3.2 for DRPP in 
order to prove Theorem 1 .7(i). We now generalize it to MWRPP 
in order to prove Theorem 1.7(ii). 

To this end, we replace each undirected edge (m, v] in an 
MWRPP instance by two directed arcs (m, v) and (v, u), where 
we force the undirected required edges of the MWRPP instance 
to be traversed in the cheaper direction: 

Lemma 3.6. Let I {G,c,R) be an MWRPP instance and 
let /' := {G',c,R') be the DRPP instance obtained from I as 
follows: 

- G' is obtained by replacing each edge {u, v] of G by two 
arcs (m, v) and (v, u), 

- R' is obtained from R by replacing each edge (m, v] 6 R by 
an arc (m, v) if c(u, v) < c(v, u) and by (v, u) otherwise. 

Then, 

(i) each feasible solution T' for /' is a feasible solution of the 
same cost for I and, 

(ii) for each feasible solution T for /, there is a feasible solu¬ 
tion T' for r with c{T') < 3c(T). 

Proof. Statement (i) is obvious since each required edge of I 
is served by T' in at least one direction. Moreover, the cost 
functions in I and /' are the same. 

Towards (ii), let T be a feasible solution for I, that is, T is a 
closed walk that traverses all required arcs and edges of I. We 
show how to transform T into a feasible solution for /'. Let {u, v) 
be an arbitrary required arc of /' that is not traversed by T. Then, 
I contains a required edge (m, v] and T contains arc (v, u) of /'. 
Moreover, c{u, v) < c(v, u). Thus, we can replace (v, u) on T by 


the sequence of arcs (v, u), (u, y), (y, u). This sequence serves 
the required arc («, y) of /' and costs c(y, u) + c(u, y) + c(v, u) < 
3c(m, y). □ 

Using Lemma 3.6, it is easy to prove Theorem L7(ii). 

Proof of Theorem 1.7(ii). Given an MWRPP instance I - 
(G, c, R), compute a DRPP instance /' := (G',c, R') as described 
in Lemma 3.6. This can be done in linear time. 

Let Vr be a set of vertices containing exactly one vertex of 
each connected component of G'[R'] and let T* be an optimal 
solution for I. Observe that T* is not necessarily a feasible 
solution for /', since it might serve required arcs of /' in the 
wrong direction. Yet T* is a closed walk in G' visiting all 
vertices of Vr. Moreover, by Lemma 3.6, /' has a feasible 
solution T' with c{T') < 3c(T*). 

Thus, applying Algorithm 3.2 to /' and Vr yields a feasible 
solution T of cost at most c(T') + a(C} ■ c(T*) < 3c(T*) + a(C) ■ 
c{T*) due to Lemma 3.5. Finally, T is also a feasible solution 
for I by Lemma 3.6. □ 

Remark 3.7. If a required edge {u, y] has c(u, y) = c(y, u), then 
we replace it by two arcs (u, y) and (y, u) in the input graph G and 
replace (m, y] by an arbitrary one of them in the set R of required 
arcs without influencing the approximation factor. This gives a 
lot of room for experimenting with heuristics that “optimally” 
orient undirected required edges when converting MWRPP to 
DRPP [13, 34]. Indeed, we will do so in Section 5. 

4 Capacitated Arc Routing 

We now present our approximation algorithm for MWCARP, 
thus proving Theorem 1.7(iii). Our algorithm follows the “route 
first, cluster second”-approach [3, 23, 31, 40, 41] and exploits 
the fact that joining all vehicle tours of a solution gives an 
MWRPP tour traversing all positive-demand arcs and the depot. 
Thus, in order to approximate MWCARP, the idea is to first 
compute an approximate MWRPP tour and then split it into 
subtours, each of which can be served by a vehicle of capacity Q. 
Then we close each subtour by shortest paths via the depot. We 
now describe our approximation algorithm for MWCARP in 
detail. For convenience, we use the following notation. 

Definition 4.1 (Demand arc). For a mixed graph G = {V,A,E) 
with demand function d: £ U A ^ N U [0], we define 

R^ {a € E U A \ d(a) > 0] 

to be the set of demand arcs. 

We construct MWCARP solutions from what we call feasible 
splittings of MWRPP tours T. 
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Definition 4.2 (Feasible splitting). For an MWCARP in¬ 
stance I — (G, Vo, c, d, Q), let F be a closed walk containing 
all arcs in Rj and W - (wi,.. ., Wf) be a tuple of segments of T. 
In the following, we abuse notation and refer by W to both the 
tuple and the set of walks it contains. 

Consider a serving function s: VF —» 2*'' that assigns to each 
walk w the set s(w) of arcs in that it serves. We call (W, s) a 
feasible splitting ofT if the following conditions hold: 

(i) the walks in W are mutually non-overlapping segments 
of F, 

(ii) when concatenating the walks in W in order, we obtain a 
subsequence of F, 

(iii) each w, 6 W begins and ends with an arc in s(w,), 

(iv) {s(w,) I Wi 6 W) is a partition of Rd, and 

(v) for each w, e W, we have d{e) < Q and, if i < €, 

then Yjees(wi) d{e) + d(a) > Q, where a is the hrst arc served 
by Wi+i. 

Constructing feasible splittings. Given an MWCARP in¬ 
stance I - (G, Vo, c, d, Q), a feasible splitting (VF, s) of a closed 
walk F that traverses all arcs in Rj can be computed in linear 
time using the following greedy strategy. We assume that each 
arc has demand at most Q since otherwise I has no feasible so¬ 
lution. Now, traverse F, successively dehning subwalks w eW 
and the corresponding sets i(w) one at a time. The traversal 
starts with the first arc a e Rd of T and by creating a sub¬ 
walk w consisting only of a and i(w) = (a). On discovery of a 
still unserved arc a e Rd\ (Uw'ew do the following. If 

Zieej(w) d(e) + d(a) < Q, then add a to s(w) and append to w the 
subwalk of F that was traversed since discovery of the previous 
unserved arc in Rd- Otherwise, mark w and s(w) as finished, 
start a new tour w 6 VF with a as the hrst arc, set s(w) = (a), and 
continue the traversal of F. If no such arc a is found, then stop. 
It is not hard to verify that (VF, s) is indeed a feasible splitting. 

The algorithm. Algorithm 4.1 constructs an MWCARP solu¬ 
tion from an approximate MWRPP solution F containing all 
demand arcs and the depot vq. In order to ensure that F con¬ 
tains Vo, Algorithm 4.1 assumes that the input graph has a de¬ 
mand loop (vo, Vo): If this loop is not present, we can add it 
with zero cost. Note that, while this does not change the cost of 
an optimal solution, it might increase the number of connected 
components in the subgraph induced by demand arcs by one. 
To compute an MWCARP solution from F, Algorithm 4.1 hrst 
computes a feasible splitting (VF, s) of F. To each walk w, e VF, 
it then adds a shortest path from the end of w, to the start of w,- 
via the depot. It is not hard to check that Algorithm 4.1 indeed 
outputs a feasible solution by using the properties of feasible 
splittings and the fact that F contains all demand arcs. 

Remark 4.3. Instead of computing a feasible splitting of F 
greedily. Algorithm 4.1 could compute a splitting of F into 
pairwise non-overlapping segments that provably minimizes the 
cost of the resulting MWCARP solution [4, 31, 40, 41]. Indeed, 
we will do so in our experiments in Section 5. For the analysis 
of the approximation factor, however, the greedy splitting is 
sufhcient and more handy, since the analysis can exploit that 


two consecutive segments of a feasible splitting serve more than 
Q units of demand (excluding, possibly, the last segment). 

The remainder of this section is devoted to the analysis of the 
solution cost, thus proving the following proposition, which, 
together with Theorem 1.7(ii), yields Theorem 1.7(iii). 

Proposition 4.4. Let I - (G, vq, c, d, Q) be an MWCARP in¬ 
stance and let /' be the instance obtained from I by adding a 
zero-cost demand arc (vq, vq) if it is not present. 

If MWRPP is /3(C)-approximable in f(n) time, then Algo¬ 
rithm 4.1 applied to /' computes a (8yS(C -i- 1) -t 3)-approxima- 
tion for I in f(C -i- 1) -t G(n^) time. Flerein, C is the number of 
connected components in G{Rd}. 

The following lemma follows from the fact that the concatena¬ 
tion of all vehicle tours in any MWCARP solution yields an 
MWRPP tour containing all demand arcs and the depot. 

Lemma 4.5. Let I - (G, vq, c, d, Q) be an MWCARP instance 
with (vq, Vq) 6 Rd and an optimal solution (VF*, s*). The closed 
walk F and its feasible splitting (VF, s) computed in lines 3 and 3 
of Algorithm 4.1 satisfy c(VF) < c(F) < y8(C)c(VF*), where C is 
the number of connected components in G{Rd}. 

Proof. Consider an optimal solution (VF*, s*) to I. The closed 
walks in VF* visit all arcs in Rd. Concatenating them to a 
closed walk F* gives a feasible solution for the MWRPP in¬ 
stance r - {G,c,Rd) in line 3 of Algorithm 4.1. Moreover, 
c(F*) = c(VF*). Thus, we have c(F) < f(C)c(T*) in line 3. More¬ 
over, by Definition 4.2(i), one has c(VF) < c(F). This finally 
implies c(VF) < c(F) < /3(C)c(F*) = f(C)c(W*) in line 3. □ 

For each w,- e VF, it remains to analyze the length of the shortest 
paths from vq to w, and from w, to vq added in line 3 of Algo¬ 
rithm 4.1. We bound their lengths in the lengths of an auxiliary 
walk A(wi) from vq to w, and of an auxiliary walk Z(w,) from w, 
to Vq. The auxiliary walks A(w;) and Z(w,) consist of arcs of VF, 
whose total cost is bounded by Lemma 4.5, and of arcs of an op¬ 
timal solution (VF*, s*). We show that, in total, the walks A(w,) 
and Z(w,) for all w,- e VF use each subwalk of VF and VF* at most 
a constant number of times. To this end, we group the walks 
in VF into consecutive pairs, for each of which we will be able to 
charge the cost of the auxiliary walks to a distinct vehicle tour 
of the optimal solution. 

Definition 4.6 (Consecutive pairing). For a feasible split¬ 
ting (VF, s) with VF = (wi,..., W(), we call 

W^:^{(w2i-uw2i)\ie{l,...,[£/2i}} 

a consecutive pairing. 

We can now show, by applying FlaU’s theorem [29], that each 
pair traverses an arc from a distinct tour of an optimal solution. 

Lemma 4.7. Let I — (G, vq, c, d, Q) be an MWCARP instance 
with an optimal solution (VF*, s*) and let VF^ be a consecu¬ 
tive pairing of some feasible splitting (VF, s). Then, there is 
an injective map f: VF^ ^ VF*, (w,, w,+i) i-> w* such that 
(s(w,) U s(W;+i)) n S*(W*) ^ 0. 

Proof. Define an undirected bipartite graph B with the partite 
sets VV^^ and VF*. A pair (w;, w/+i) 6 VF^ and a closed walk w* e 
VF* are adjacent in B if (i(w,) U i(w,+i)) n i*(w*) 4^ 0. We prove 
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Algorithm 4.1: Algorithm for the proof of Proposition 4.4. 

Input: An MWCARP instance I - (G, vo, c, d, Q) such that (vo, vo) e Rd and such that G{Rd} has C connected components. 
Output: A feasible solution for I. 

/* Compute a base tour containing all demand arcs and the depot */ 

1 /' <— MWRPP instance (G, c, Rd)', 

2 r <— j8(C)-approximate MWRPP tour for /' starting and ending in vq; 

/* Split the base tour into one tour for each vehicle */ 

3 iW, i) <— a feasible splitting of T ; 

4 foreach w e W do 

5 1^ close w by adding shortest paths from vq to s and from f to vq in G, where s, t are the start and endpoints of w, respectively; 

6 return {W, s); 


A'iwi) Z'iwi) 



Figure 4.1: Illustration of Definition 4.8. Dotted lines are ancil¬ 
lary lines. Thin arrows are walks. The braces along the bottom 
show a consecutive pairing of walks w,_i,..., w,+ 2 - Bold arcs 
are pivot arcs. Here, p{i) is exactly the pair that contains w,- 
and q{i) is the next pair. 

that B allows for a matching that matches each vertex of W'^ 
to some vertex in W*. To this end, by Hall’s theorem [29], it 
suffices to prove that, for each subset S c it holds that 
|Ab( 5)| > |5|, where Ab(5) := Ab(v) and Ab(v) is the set 
of neighbors of a vertex v in B. Observe that, by Definition 4.2(v) 
of feasible splittings, for each pair (w,, w/+i) e W'^, we have 
d{s{wi) U i(w/+i)) > Q. Since the pairs serve pairwise disjoint 
sets of demand arcs by Definition 4.2(iv), the pairs in S serve a 
total demand of at least Q ■ | in the closed walks Nb{S) c W*. 

Since each closed walk in NsiS ) serves demand at most Q, the 
set NsiS ) is at least as large as 5, as required. □ 

In the following, we fix an arbitrary arc in (s(w,) U s(w;+i)) n 
s*(w*) for each pair (w,, w,+i) e and call it the pivot arc 
of (w/, w,+i). Informally, the auxiliary walks A(w,), Z(w,) men¬ 
tioned before are constructed as follows for each walk w,. To 
get from the endpoint of w,- to vq, walk along the closed walk T 
until traversing the first pivot arc a, then from the head of a to vq 
follow the tour of W* containing a. To get from vq to w,, take the 
symmetric approach: walk backwards on T from the start point 
of Wi until traversing a pivot arc a and then follow the tour of W* 
containing a. The formal definition of the auxiliary walks A{w) 
and Z(w) is given below and illustrated in Figure 4.1. 

Definition 4.8 (Auxiliary walks). Let I - (G, vq, c, d, Q) be an 
MWCARP instance, {W*, s*) be an optimal solution, and be 
a consecutive pairing of some feasible splitting {W, s) of a closed 
walk T containing all arcs Rd and vq, where W - iw\, ..., W(). 

Let (j): W'^ W* be an injective map as in Lemma 4.7 and, 
for each pair (w;, w,+i) e W^, let 


A*(w,, w,+i) be a subwalk of 0(w,, Wi+i) from vq to the tail of 
the pivot arc of (w,, w,+i), 

Z*(w,, w,+i) be a subwalk of 0(w,, w,+i) from the head of the 
pivot arc of (w;, w,+i) to vq. 

For each walk w, e W with i > 3 (that is, w,- is not in the first 
pair of W^), let 

p(i) be the index of the pair whose pivot arc is traversed first 
when walking T backwards starting from the starting 
point of Wi, 

A'(wi) be the subwalk of T starting at the end point of 
A*{w 2 p(i)-\,W 2 p{i)) and ending at the start point of w,, 
and 

A(w;) be the walk from vq to the start point of w, following 
first A*(w 2 p(i)-i,W 2 p(i)) and then A'(w,). 

For each walk w; e W with i < { -3 (that is, Wj is not in the last 
pair of W^, where wt might not be in any pair if t is odd), let 

q{i) be the index of the pair whose pivot arc is traversed first 
when following T starting from the end point of w,-, 

Z'(w;) be the subwalk of T starting at the end point of w, and 
ending at the start point of Z*(w 2 ^(,)-i, W 2 ^(,)), and 

Z(w;) be the walk from the end point of w, to vq following 
first Z'(w,) and then Z*(w2^(o-i, W2^(o)- 

We are now ready to prove Proposition 4.4, which also concludes 
our proof of Theorem 1.7. 

Proof of Proposition 4.4. Let / = (G, vq, c, d, Q) be an MWRPP 
instance and (W*,i*) be an optimal solution. If there is no 
demand arc (vq, vq) in I, then we add it with zero cost in order 
to make Algorithm 4.1 applicable. This clearly does not change 
the cost of an optimal solution but may increase the number of 
connected components of G{Rd} to C -H 1. 

In lines 3 and 3, Algorithm 4.1 computes a tour T and its 
feasible splitting (VT, s), which works in f(C -i- 1) -i- G(n^) time 
by Theorem L7(ii). Denote W — (wi,... ,we). The solution 
returned by Algorithm 4.1 consists, for each 1 < i < ^, of a 
tour starting in vq, following a shortest path to the starting point 
of Wj, then w,, and a shortest path back to vq. 

For i > 3, the shortest path from vq to the starting point of w, 
has length at most c(A(w;)). For i <{ - 3, the shortest path 
from the end point of w, to vo has length at most c(Z(w,)). This 
amounts to c(A(w,)) + Yfifl c(Z(w,)). To bound the costs 
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A'(w,+2) 



Figure 4.2: Illustration of the situation in which a maximum number of five different walks in W traverse the same pivot arc (the 
bold arc of w,) in their respective auxiliary walks. 


of the shortest paths attached to w, for i e {l,2,£ - 2,{ - 
observe the following. For each i e (1,2), the shortest paths 
from vq to the start point of w, and from the end point of 
to Vo together have length at most c{T). The shortest path from 
the end point of W( to vq has length at most c{T) - c(W). Thus, 
the solution returned by Algorithm 4.1 has cost at most 

r { f-3 

^ c(wi) + c(A(wi)) + Y + ^cCT) - c(W) 

i=l (=3 1=1 

e e-3 

= 2 c(A(w,)) + 2 c(Z(w,)) + 3c{T) 

1=3 1=1 

= 3c(T) + 

f f-3 

+ Yj ^ c(Z*(W2^(,)-l, W2g(i))) + 

;=3 !=I 

(SI) 

e r-3 

+ 2c(A'(w,)) + 2c(Z'(w,)). (S2) 

1=3 1=1 

Observe that, for a fixed /, one has p{i) - p(j) only for j < i + 2 
and q{i) - q{j) only for j > i - 2. Moreover, by Lemma 4.7 
and Definition 4.8, if p{i) + pij), then A*{w 2 p(i)-\,wip(i)) and 
A*{w 2 p(iy\,W 2 pU)) are subwalks of distinct walks of W*. Simi¬ 
larly, Z*{w 2 qp)-i,W 2 q(i)) and Z*{w 2 q(j)-\,W 2 qU)) are subwalks of 
distinct walks of W* if q{i) + q(j). Hence, sum (SI) counts each 
arc of W* at most three times and is therefore bounded from 
above by 3c(lT*). 

Now, for a walk w,, let jTf, be the set of walks Wj such that 
any arc a of w, is contained in A'{wj) and let Z, be the set of 
walks such that any arc a of w, is contained in Z'(w^). Observe 
that A'{wj) and Z'{wj) cannot completely contain two walks 
of the same pair of the consecutive pairing of W since, by 
Lemma 4.7, each pair has a pivot arc and A'{wj) and Z'(w;) both 
stop after traversing a pivot arc. Hence, the walks in ^iUXi can 
be from at most three pairs of the pair containing w; and 
the two neighboring pairs. Finally, observe that w, itself is not 
contained in U Zi- Thus, jTf,- U Xi contains at most five walks 
(Figure 4.2 shows a worst-case example). Therefore, sum (S2) 
counts every arc of W at most five times and is bounded from 
above by 5c(W). 


Thus, Algorithm 4.1 returns a solution of cost 3c(7’) + 5c(lT) -i- 
3c(lT*) which, by Lemma 4.5, is at most 8c(7’) + 3c(W*) < 
me + l)c(lT*) H- 3c(VT*) < (8/3(C -H 1) H- 3)c(lT*). □ 

5 Experiments 

Our approximation algorithm for MWCARP is one of many 
“route first, cluster second”-approaches, which was first applied 
to CARP by Ulusoy [40] and led to constant-factor approxima¬ 
tions for the undirected CARP [31,41]. Notably, Belenguer et al. 
[4] implemented Ulusoy’s heuristic [40] for the mixed CARP 
by computing the base tour using path scanning heuristics. Our 
experimental evaluation will show that Ulusoy’s heuristic can 
be substantially improved by computing the base tour using our 
Theorem 1.7(ii). 

For the evaluation, we use the mval and Ipr benchmark sets 
of Belenguer et al. [4] for the mixed (but non-windy) CARP and 
the egl-large benchmark set of Brandao and Eglese [10] for 
the (undirected) CARP. We chose these benchmark sets because 
relatively good lower bounds to compare with are known [9, 27]. 
Moreover, the egl-large set is of particular interest since it 
contains large instances derived from real road networks and 
the mval and Ipr sets are of particular interest since Belenguer 
et al. [4] used them to evaluate their variant of Ulusoy’s heuristic 
[40], which is very similar to our algorithm. 

In the following. Section 5.1 describes some heuristic en¬ 
hancements of our algorithm. Section 5.2 interprets our experi¬ 
mental results, and Section 5.3 describes an approach to trans¬ 
form instances of existing benchmark sets into instances whose 
positive-demand arcs induce a moderate number of connected 
components. 

5.7 Implementation details 

Since our main goal is evaluating the solution quality rather than 
the running time of our algorithm, we sacrificed speed for sim¬ 
plicity and implemented it in Python.' Thus, the running time 
of our implementation is not competitive to the implementations 
by Belenguer et al. [4] and Brandao and Eglese [10].^ However, 

'Source code available at http://gitlab.com/rvb/mwcarp-approx 
^We do not provide running time measurements since we processed many 
instances in parallel, which does not yield reliable measurements. 
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it is clear that a careful implementation of our algorithm in C++ 
will yield competitive running times; The most expensive steps 
of our algorithm are the Floyd-Warshall all-pair shortest path 
algorithm [20], which is also used by Belenguer et al. [4] and 
Brandao and Eglese [10], and the computation of an uncapaci¬ 
tated minimum-cost flow, algorithms for which are contained in 
highly optimized C-H-i- libraries like LEMON. ^ 

In the following, we describe heuristic improvements over 
the algorithms presented in Sections 3 and 4, which were de¬ 
scribed there so as to conveniently prove upper bounds rather 
than focusing on good solutions. 

5.1.1 Joining connected components 

We observed that, in all but one instance of the egl-large, 
Ipr, and mval benchmark sets, the set of positive-demand arcs 
induce only one connected component. Therefore, connecting 
them is usually not necessary and the call to Algorithm 3.1 in 
Algorithm 3.2 can be skipped completely. If not, then, contrary 
to the description of Algorithm 3.1, we do not arbitrarily select 
one vertex from each connected component and join them using 
an approximate A-ATSP tour as in Algorithm 3.1 or using an 
optimal A-ATSP tour as for Corollary 1.8. 

Instead, using bmte force, we try all possibilities of choosing 
one vertex from each connected component and connecting them 
using a cycle and choose the cheapest variant. If the positive- 
demand arcs induce C connected components, then this takes 
0{n^ ■ Cl + n^) time in an n-vertex graph. That is, for C < 3, 
implementing Algorithm 3.1 in this way does not increase its 
asymptotic time complexity. 

5.1.2 Choosing service direction 

The instances in the egl-large, Ipr, and mval benchmark sets 
are not windy. Thus, as pointed out in Remark 3.7, when com¬ 
puting the MWRPP base tour, we are free to choose whether to 
replace a required undirected edge [m, v) by a required arc (m, v) 
or a required arc (v, u) (and adding the opposite non-required arc) 
without increasing the approximation factor in Theorem 1.7(ii). 

We thus implemented several heuristics for choosing what we 
call the service direction of the undirected edge {u, v). Some of 
these heuristics choose the service direction independently for 
each undirected edge, similarly to Corberan et al. [13], others 
choose it for whole undirected paths and cycles, similarly to 
Mourao and Amado [34]. 

We now describe these heuristics in detail. To this end, let 
G denote our input graph and R be the set of required arcs. 

EO(R) assigns one of the two possible service directions to 
each undirected edge uniformly at random. 

EO(P) replaces each undirected edge {u, v) e R by an 
arc (u,v) e R if balanceG[R](v) < balanceG[R](M), by 
an arc (v, u) e R if balanceG[R](v) > balancec[fi](M), and 
chooses a random service direction otherwise. 

EO(S) randomly chooses one endpoint v of each undirected 
edge (m, v} e R and replaces it by an arc (m, v) e R if 
balanceG[R](v) < 0 and by (v, u) e R otherwise. 

^http://lemon.cs.elte. hu/ 


Herein, “EO” is for “edge orientation”. The “R” in parentheses 
is for “random”, the “P” for “pair” (since it levels the balances 
of pairs of vertices), and the “S” is for “single” (since it mini¬ 
mizes I balance(v)| of a single random endpoint v of the edge). 

In addition, we experiment with three heuristics that do not 
orient independent edges but long undirected paths. Herein, the 
aim is that a vehicle will be able to serve all arcs resulting from 
such a path in one run. 

Eirst, the heuristics repeatedly search for undirected cycles 
in G[R] and replace them by directed cycles in R. When no 
undirected cycle is left, then the undirected edges of G[R] form 
a forest. The heuristics then repeatedly search for a longest undi¬ 
rected path in G[R] and choose its service direction as follows. 

PO(R) assigns the service direction randomly. 

PO(P) assigns the service direction by leveling the balance of 
the endpoints of the path, analogously to EO(P). 

PO(S) assigns the service direction so as to minimize 
I balance(v)| for a random endpoint v of the path, analo¬ 
gously to EO(S). 

Generally, we observed that these heuristics first find three or 
four long paths with lengths from 5 up to 15. Then, the length 
of the found paths quickly decreases: In most instances, at least 
half of all found paths have length one, at least 3/4 of all found 
paths have length at most two. 

We now present experimental results for each of these six 
heuristics. 

5.1.3 Tour splitting 

As pointed out in Remark 4.3, the MWRPP base tour initially 
computed in Algorithm 4.1 can be split into pairwise non¬ 
overlapping subsequences so as to minimize the total cost of the 
resulting vehicle tours. To this end, we apply an approach of 
Beasley [3] and Ulusoy [40], which by now can be considered 
folklore [4, 31,41] and works as follows. 

Denote the positive-demand arcs on the MWRPP base tour 
as a sequence ai,... ,at. To compute the optimal splitting, we 
create an auxiliary graph with the vertices \,... ,€+ 1. Between 
each pair (/, j) of vertices, there is an edge whose weight is the 
cost for serving all arcs a,-, a,+i,..., ap\ in this order using one 
vehicle. That is, its cost is oo if the demands of the arcs in this 
segment exceed the vehicle capacity Q and otherwise it is the 
cost for going from the depot vq to the tail of a,, serving arcs a, 
to and returning from the head of aj-i to the depot v. Then, 
a shortest path from vertex 1 to f -H 1 in this auxiliary graph 
gives an optimal splitting of the MWRPP base tour into mutually 
non-overlapping subsequences. 

Additionally, we implemented a trick of Belenguer et al. 
[4] that takes into account that a vehicle may serve a seg¬ 
ment a ,-,... ,ak, ak+\ ..., flj-i by going to the tail of a^+i, serving 
arcs Qk+i to aj-\, going from the head of to the tail of a,, 
serving arcs a, to and finally returning from the head of Uk 
to the depot vq. Our implementation tries all such k and assigns 
the cheapest resulting cost to the edge between the pair (i, j) of 
vertices in the auxiliary graph. 

Of course one could compute the optimal order for serving 
the arcs of a segment a,,..., aj-i from the depot vq, but this 
would again be the NP-hard DRPP. 
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5.2 Experimental results 

Our experimental results for the Ipr, mval, and egl-large 
instances are presented in Tables 5.1 and 5.2. We grouped 
the results for the Ipr and mval instances into one table and 
subsection since our conclusions about them are very similar. 
We explain and interpret the tables in the following. 

5.2.7 Results for the Ipr and mval instances 

Table 5.1 presents known results and our results for the Ipr and 
mval instances. Each column for our results was obtained by 
mnning our algorithm with the corresponding service direction 
heuristic described in Section 5.1.2 on each instance 20 times 
and reporting the best result. The number 20 has been chosen 
so that our results are comparable with those of Belenguer et al. 
[4], who used the same number of runs for their path scanning 
heuristic (column PSRC) and their “route hrst, cluster second” 
heuristic (column lURL), which computes the base tour using 
a path scanning heuristic and then splits it using all tricks de¬ 
scribed in Section 5.1.3. Columns LB and UB report the best 
lower and upper bounds computed by Belenguer et al. [4] and 
Gouveia et al. [27] (usually not using polynomial-time algo¬ 
rithms). Finally, column IM shows the result that Belenguer 
et al. [4] obtained using an improved variant of the “augment 
and merge” heuristic due to Golden and Wong [25]. 

Table 5.1 shows that our algorithm with the EO(S) service 
direction heuristic solved three instances optimally, which other 
polynomial-time heuristics did not. The EO(P) heuristic solved 
one instance optimally, which also other polynomial-time heuris¬ 
tics did not. Moreover, whenever no variant of our algorithm 
finds the best result, then some variant yields the second best. 
It is outperformed only by IM in 26 out of 49 instances and 
by lURL in only one instance. Apparently, our algorithm out¬ 
performs PSRC and lURL. Notably, lURL differs from our 
algorithm only in computing the base tour heuristically instead 
of using our Theorem 1.7(ii). Thus, “route hrst, cluster second” 
heuristics seem to beneht from computing the base tour using 
our MWRPP approximation algorithm. 

Remarkably, when our algorithm yields the best result using 
one of the service direction heuristics described in Section 5.1.2, 
then usually other service direction heuristics also hnd the best 
or at least the second best solution. Thus, the choice of the 
service direction heuristic does not play a strong role. Indeed, 
we also experimented with repeating our algorithm 20 times on 
each instance, each time choosing the service direction heuristic 
randomly. The results come close to choosing the best heuristic 
for each instance. 

5.2.2 Results for the egl-large instances 

Table 5.2 reports known results and our results for the 
egl-large benchmark set. Again, each column for our results 
was obtained by running our algorithm with the corresponding 
service direction heuristic described in Section 5.1.2 on each 
instance 20 times. The column LB reports lower bounds by 
Bode and Irnich [9], the column UB shows the upper bound that 
Brandao and Eglese [10] obtained using their tabu-search algo¬ 
rithm (which generally does not run in polynomial time). The 
column PS shows the cost of the initial solution that Brandao 
and Eglese [10] computed for their tabu-search algorithm using 


a path scanning heuristic. Brandao and Eglese [10] implemented 
several polynomial-time heuristics for computing these initial 
solution. Among them, “route hrst, cluster second” approaches 
and “augment and merge” heuristics. In their work, path scan¬ 
ning yielded the best initial solutions. In Table 5.2, we see that 
our algorithm clearly outperforms it. Moreover, we see that es¬ 
pecially our PO service direction heuristics are successful. This 
is because the egl-large instances are undirected and, thus, 
contain many cycles consisting of undirected positive-demand 
arcs that can be directed by our PO heuristics without increasing 
the imbalance of vertices. 

5.3 The Ob benchmark set 

Given our theoretical work in Sections 3 and 4, the solution 
quality achievable in polynomial time appears to mainly depend 
on the number C on connected components in the graph induced 
by the positive-demand arcs. However, we noticed that widely 
used benchmark instances for variants of CARP have C = 1. In 
order to motivate a more representative evaluation of the quality 
of polynomial-time heuristics for variants of CARP, we provide 
the Ob set of instances derived from the Ipr and egl-large 
instances with C from 2 to 5. The approach can be easily used 
to create more components. 

The Ob instances^ simulate cities that are divided by a river 
that can be crossed via a few bridges without demand. The 
underlying assumption is that, for example, household waste 
does not have to be collected from bridges. We generated the 
instances as follows. 

As a base, we took sufficiently large instances from the Ipr 
and egl-large sets (it made little sense to split the small mval 
or Ipr instances into several components). In each instance, we 
chose one or two random edges or arcs as “bridges”. Let B be the 
set of their end points. We then grouped all vertices of the graph 
into clusters: For each v e B, there is one cluster containing 
all vertices that are closer to v than to all other vertices of B. 
Finally, we deleted all but a few edges between the clusters, so 
that usually two or three edges remain between each pair of 
clusters. The demand of the edges remaining between clusters is 
set to zero, they are our “bridges” between the river banks. The 
intuition is that, if one of our initially chosen edges or arcs (m, v) 
was a bridge across a relatively straight river, then indeed every 
point on us side of the river would be closer to u than to v. 
We discarded and regenerated instances that were not strongly 
connected or had river sides of highly imbalanced size (three 
times below the average component size). Figure 5.1 shows 
three of the resulting instances. 

Note that this approach can yield instances where C exceeds 
the number of clusters since deleting edges between the clusters 
may create more connected components in the graph induced 
by the positive-demand arcs. The approach straightforwardly 
applies to generating instances with even larger C: One simply 
chooses more initial “bridges”. 

As a starting point. Table 5.3 shows the number C, a lower 
bound (LB) computed using an ILP relaxation of Gouveia et al. 
[27], and the best upper bound obtained using our approximation 
algorithm for each of the Ob instances using any of the service 
direction heuristics in Section 5.1.2. The “ob-” instances were 

Available at http://gitlab.com/rvb/mwcarp-ob and named after the 

river Ob, which bisects the city Novosibirsk. 
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Table 5.1: Known results [4, 27] and our results for the Ipr and mval instances. See Section 5.2.1 for a description of the table. 
The best polynomial-time computed upper bound is written in boldface, the second best is underlined, names of instances solved 
optimally by our algorithms are also written in boldface. 


Instance 

LB 

Known results 

UB PSRC IM 

lURL 

EO(R) 

EO(P) 

Our results 
EO(S) PO(R) 

PO(P) 

PO(S) 

lpr-a-01 

13 484 

13484 

13 600 

13 597 

13 537 

13484 

13484 

13484 

13484 

13484 

13484 

lpr-a-02 

28 052 

28 052 

29 094 

28 377 

28 586 

28225 

28 381 

28 356 

28 239 

28 381 

28 356 

lpr-a-03 

76115 

76155 

79 083 

77 331 

78 151 

77 019 

76783 

76 964 

76951 

76783 

76 820 

lpr-a-04 

126946 

127 352 

133 055 

128566 

131884 

130470 

130137 

130255 

130198 

130171 

130186 

lpr-a-05 

202736 

205499 

215153 

207597 

212167 

210328 

209 980 

210265 

210235 

210139 

210 344 

lpr-b-01 

14 835 

14835 

15 047 

14918 

14 868 

14869 

14 869 

14835 

14835 

14835 

14835 

lpr-b-02 

28 654 

28654 

29 522 

29 285 

28 947 

28749 

28689 

28689 

28 757 

28790 

28 727 

lpr-b-03 

77 859 

77 878 

80017 

80591 

79 910 

78428 

78 745 

78 853 

78 645 

78810 

78 743 

lpr-b-04 

126 932 

127454 

133 954 

129449 

132241 

130024 

130024 

130024 

130076 

130024 

130024 

lpr-b-05 

209 791 

211771 

223 473 

215 883 

219 702 

217024 

216 769 

216459 

217 079 

216639 

216 659 

lpr-c-01 

18 639 

18639 

18 897 

18 744 

18 706 

18943 

18695 

18 732 

18708 

18752 

18 752 

lpr-c-02 

36 339 

36 339 

36 929 

36485 

36 763 

37177 

36 649 

36 856 

36723 

36711 

36 662 

lpr-c-03 

111 117 

111632 

115 763 

112462 

114539 

115 399 

114438 

114888 

114 336 

114335 

114290 

lpr-c-04 

168 441 

169254 

174416 

171823 

173 161 

174088 

172089 

172 902 

172637 

172172 

172 365 

lpr-c-05 

257 890 

259937 

268 368 

262089 

266 058 

266637 

263 989 

264947 

264911 

264263 

264665 

Instance 

LB 

UB 

PSRC 

IM 

lURL 

EO(R) 

EO(P) 

EO(S) 

PO(R) 

PO(P) 

PO(S) 

mvallA 

230 

230 

243 

243 

231 

245 

230 

238 

234 

239 

234 

mval IB 

261 

261 

314 

276 

292 

298 

285 

285 

307 

307 

307 

mval 1C 

309 

315 

427 

352 

357 

367 

362 

362 

367 

372 

370 

mval2A 

324 

324 

409 

360 

374 

397 

353 

324 

369 

369 

368 

mval2B 

395 

395 

471 

407 

434 

431 

424 

424 

424 

424 

424 

mval2C 

521 

526 

644 

560 

601 

621 

622 

592 

600 

624 

594 

mval3A 

115 

115 

133 

119 

128 

131 

129 

125 

122 

121 

121 

mval3B 

142 

142 

162 

163 

150 

151 

148 

151 

149 

147 

147 

mval3C 

166 

166 

191 

174 

192 

194 

190 

189 

194 

200 

200 

mval4A 

580 

580 

699 

653 

684 

648 

622 

645 

651 

647 

647 

mval4B 

650 

650 

775 

693 

737 

709 

687 

709 

690 

674 

682 

mval4C 

630 

630 

828 

702 

740 

750 

721 

736 

714 

722 

722 

mval4D 

746 

770 

1015 

810 

905 

875 

871 

852 

872 

879 

870 

mval5A 

597 

597 

733 

686 

683 

672 

619 

652 

614 

649 

644 

mval5B 

613 

613 

718 

677 

677 

687 

662 

685 

653 

653 

654 

mval5C 

697 

697 

809 

743 

811 

788 

773 

778 

783 

804 

783 

mval5D 

719 

739 

883 

821 

855 

859 

840 

854 

845 

840 

836 

mval6A 

326 

326 

392 

370 

367 

348 

347 

348 

344 

351 

350 

mval6B 

317 

317 

406 

346 

354 

345 

331 

354 

351 

343 

347 

mvalbC 

365 

371 

526 

402 

444 

455 

435 

435 

461 

454 

454 

mval7A 

364 

364 

439 

381 

390 

428 

386 

411 

404 

398 

398 

mval7B 

412 

412 

507 

470 

491 

474 

435 

463 

460 

460 

454 

mval7C 

424 

426 

578 

451 

504 

507 

474 

483 

489 

482 

482 

mvalSA 

581 

581 

666 

639 

651 

648 

635 

635 

639 

627 

641 

mvalSB 

531 

531 

619 

568 

611 

616 

582 

592 

596 

598 

600 

mvalSC 

617 

638 

842 

718 

762 

799 

737 

729 

776 

764 

779 

mval9A 

458 

458 

529 

500 

514 

503 

486 

493 

496 

490 

498 

mval9B 

453 

453 

552 

534 

502 

518 

504 

503 

503 

523 

506 

mval9C 

428 

429 

529 

479 

498 

509 

468 

488 

485 

479 

474 

mval9D 

514 

520 

695 

575 

622 

627 

603 

610 

612 

613 

608 

mvallOA 

634 

634 

735 

710 

705 

669 

663 

661 

667 

658 

659 

mval 1 OB 

661 

661 

753 

717 

714 

708 

687 

693 

703 

703 

698 

mvallOC 

623 

623 

751 

680 

714 

709 

689 

697 

698 

695 

687 

mvallOD 

643 

649 

847 

706 

760 

778 

739 

763 

775 

743 

722 
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Table 5.2: Known results [9, 10] and our results for the egl-large Instances. See Section 5.2.2 for a description of the table. The 


best polynomial-time computed upper bound is written in boldface. 


Instance 

LB 

Known results 

UB 

PS 

EO(R) 

EO(P) 

Our results 

EO(S) PO(R) 

PO(P) 

PO(S) 

egl-gl-A 

976 907 

1049708 

1318 092 

1258 206 

1181928 

1 209108 

1 153 029 

1 158 233 

1141457 

egl-gl-B 

1093 884 

1140692 

1483179 

1 367 979 

1306521 

1 328 250 

1 293 095 

1 308 350 

1297606 

egl-gl-C 

1212151 

1282270 

1584177 

1523 183 

1456 305 

1463 009 

1432 281 

1424722 

1430 841 

egl-gl-D 

1341918 

1420126 

1 744 159 

1684343 

1609 822 

1609537 

1586 294 

1601588 

1580634 

egl-gl-E 

1482176 

1583 133 

1 841023 

1829244 

1769977 

1780089 

1716612 

1748308 

1 755 700 

egl-g2-A 

1069 536 

1 129229 

1416720 

1 372 177 

1276 871 

1304618 

1 263 263 

1249293 

1 255 120 

egl-g2-B 

1 185 221 

1255 907 

1559464 

1517245 

1410385 

1449553 

1 398 162 

1405 916 

1404533 

egl-g2-C 

1311339 

1418145 

1704234 

1661596 

1 594147 

1597266 

1538 036 

1532913 

1544214 

egl-g2-D 

1446 680 

1516103 

1918757 

1812 309 

1 728 840 

1741351 

1 695 333 

1694448 

1704 080 

egl-g2-E 

1581459 

1701681 

1998 355 

1962802 

1 883 953 

1908 339 

1851436 

1861 134 

1 861469 



(a) ob-egl-g2-E 



Figure 5.1: Three instances from the Ob benchmark set. 
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Since our algorithm outperforms many other polynomial-time 
heuristics, it is useful for computing good solutions in instances 
that are still too large to be attacked by exact, local search, or 
genetic algorithms. Moreover, it might be useful to use our 
solution as initial solution for local search algorithms. 

Our theoretical results show that one should not evaluate 
polynomial-time heuristics only on instances whose positive- 
demand arcs induce a graph with only one connected component, 
because the solution quality achievable in polynomial time is 
largely determined by this number of connected components. 
Therefore, it would be interesting to see how other polynomial¬ 
time heuristics, which do not take into account the number of 
connected components in the graph induced by the positive- 
demand arcs, compare to our algorithm in instances where this 
number is larger than one. 

Finally, we conclude with a theoretical question: It is easy 
to show a 3-approximation for the Mixed Chinese Postman 
problem using the approach in Section 3.3, yet Raghavachari 
and Veerasamy [37] showed a 3/2-approximation. Can our 
(a(C) + 3)-approximation for MWRPP in Theorem 1.7(ii) be 
improved to an (a(C) -i- 3/2)-approximation analogously? 
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